Analyzing 3D Objects in Cluttered Images
نویسندگان
چکیده
We present an approach to detecting and analyzing the 3D configuration of objects in real-world images with heavy occlusion and clutter. We focus on the application of finding and analyzing cars. We do so with a two-stage model; the first stage reasons about 2D shape and appearance variation due to within-class variation (station wagons look different than sedans) and changes in viewpoint. Rather than using a view-based model, we describe a compositional representation that models a large number of effective views and shapes using a small number of local view-based templates. We use this model to propose candidate detections and 2D estimates of shape. These estimates are then refined by our second stage, using an explicit 3D model of shape and viewpoint. We use a morphable model to capture 3D within-class variation, and use a weak-perspective camera model to capture viewpoint. We learn all model parameters from 2D annotations. We demonstrate state-of-the-art accuracy for detection, viewpoint estimation, and 3D shape reconstruction on challenging images from the PASCAL VOC 2011 dataset.
منابع مشابه
Object Recognition and Full Pose Registration in Cluttered Environments
Robust perception is a vital capability for robotic manipulation in unstructured scenes. In this context, full pose estimation of relevant objects in a scene is a critical step towards the introduction of robots into household environments. In this paper, we present an approach for building metric 3D models of objects using local descriptors from several images. Each model is optimized to fit a...
متن کاملAn Integrated System of 3D Pose Estimation and Primitive Robot Actions for Cluttered Manipulation - Motion Planning - Final Report
This project focused on designing and implementing part of a system for detecting and manipulating objects in a cluttered environment. The system, which was proposed by the Manipulation Lab, would use an ABB Robotics IRB 140 robot, a YCB object set, and multiple RGB-D cameras. Using the cameras, the system would perform 3D pose estimation for a subset of the YCB objects when the 3D CAD models a...
متن کاملFusing Color and Geometry Information for Understanding Cluttered Scenes
In this paper, we introduce a new image processing pipeline for scene recognition and pose estimation in robotic applications. Unknown objects are autonomously modeled resulting in geometric 3D models and color images. Theses models are then used for object recognition in cluttered scenes by merging color and geometry information. Our recognition approach generates new suitable feature vectors ...
متن کاملApplication of Shape Analysis on 3D Images - MRI of Renal Tumors
The image recognotion and the classification of objects according to the images are more in focus of interests, especially in medicine. A mathematical procedure allows us, not only to evaluate the amount of data per se, but also ensures that each image is pro- cessed similarly. Here in this study, we propose the power of shape analysis, in conjunction with neural networks for reducing white n...
متن کاملZhile Ren | Research Statement
Figure 1: COG descriptor encodes orientation-invariant gradient feature for objects with different views. I develop new representations and algorithms for three-dimensional (3D) scene understanding from cluttered indoor RGB-D images and outdoor video sequences. I introduce novel representations for 3D object detection systems that localize objects with cuboids and describe room layouts by Manha...
متن کامل